Web Science: The Digital-Heritage Case

نویسنده

  • Guus Schreiber
چکیده

Web Science studies the interpay between web technology and the human behaviour it induces at the micro, meso and macro level. In this extended abstract we examine Web Science research issues by taking a closer look at the area of digital heritage. We discuss engineering, communication and socio-economic aspects. 1 What is Web Science? Over the past 15 years the Web has had an increasing impact on the life of people. Web technology has changed the way people operate and communicate, both in their personal and working lives. In many ways the Web is a new phenomenon, for which the principles of the physical world do not always hold. Web science is a new scientific discipline that studies this phenomenon, in particular the interplay between Web technology and the effect it has on human behavior at the personal organizational and societal level. Good introductions into Web Science can be found in the articles of BernersLee et al. [1] and Shneiderman [2]. The program and proceeddings of the 1st Web Science Conference in Athens give a good impression of the field. In this extended abstract we illustrate research issues in Web Science by looking at one particular area, namely the digital heritage domain. Digital heritage comprises access to and interaction with large-scale virtual cultural heritage collections. We look at this domain from the experiences gained in a series of digital heritage projects, such as the ongoing work on the Europeana culture portal Europeana. We limit the discussion here to engineering, communication and socio-econmic aspects of digital heritage. 2 Engineering of digital-heritage collections Cultural heritage is an extremely knowledge-rich domain. Institutions in this field have been gathering knowledge for decades or even centuries. This knowledge is gathered in the form of a multitude of vocabularies, thesauri, classification schemes and other knowledge organization systems, which are used to describe 1 http://www.websci09.org/ 2 http://www.europeana.eu heritage objects, such as paintings, books and archival documents. These knowledge organization systems display a enormous richness, but are seldomly described in the formal way favored by computer scientists. The biggest challenge in digital-heritage ventures, such as Europeana, lies in interoperability. The description of heritage objects inherently shows a large variety of perspectives, caused by differences in type of object, time, place, culture and language. When constructing a web portal for cultural heritage, such as Europeana or E-Culture[3], we are faced with a number of research questions. Firstly, we have to provide mechanisms for explicating heritage knowledge in a machine-readable web format. SKOS is a recently released web standard for achieving this. The design of SKOS reflects some important principles, in particular the principle of “minimal ontological commitment”: schema’s for publishing knowledge on the Web need to be restricted to the minimum level of required constructs and semantic constraints for these to be usable across the field. For computer scientists with their formal background this is often counterintuitive. Secondly, to enable collection interoperability we need to provide techniques for partial alignments between knowledge organization systems. In other words: unification is infeasible in diverse domains such as cultural heritage; the best we can do is uncover the agreement and overlap that does exist. For this reason vocabulary-alignment techniques have become an active area in web research. Thirdly, due to the fact that cultural-heritage descriptions often partially consist of textual descriptions we also require a range of knowledge-extraction techniques, such as from naturallanguage processing and machine learning. Finally, we have to deal with large amounts of data, typically billions of statements (“the web of data”). This requires scalable search techniques. Given such amounts of data, the traditional notions of correctness and completeness make no sense. Therefore, alternative approaches to reasoning in a web of data are an active area of research. Engineering web data in other domains, such as health care and biology leads to similar research issues. A pervasive common issue is also the notion of web identity. What should be the URI for Pablo Picasso or for the European union? Despite the many research efforts in this area, this is still an open research problem. It is likely that solutions will require some form of societal consensus. 3 Communication in digital heritage An often-heard opinion in the cultural-heritage field is that the digital experience can never replace the “real thing”. In this view digital access is is a surrogate and should ideally be a teaser for the web visitor to come to the museum. This is a lopsided view: the virtual world provides us with alternative and complementary forms of interaction with cultural heritage. An example is the generation of personalized museum tours [4]. Such tours can be first generated for a digital experience and then be downloaded on a mobile device for a physical tour in a museum. The physical tour has limitation ins time and space; the virtual tour 3 http://www.w3.org/2004/02/skos/ 4 http://www.larkc.eu has limitations in freedom of experience. It should also be noted that digital techniques help in accessing objects that would otherwise be inaccessible for conservation reasons (e.g. old manuscripts). Thus the virtual and physical visit both have their own pros and cons. Social tagging is receiving considerable attention in the museum world (see for example the Steve Museum). Tagging is a way to involve web visitors more actively in the collection. Also, given the large amounts of poorly-described objects the heritage institutions are keen on using web visitors for creating object metadata. This raises two issues. Firstly, institutions have to come up with incentive schemes for web visitors to help annotating the objects. Secondly, quality is seen as a key characteristic of curated metdata and is not yet clear which strategies should be followed in quality control of non-curated metadata. This is now an active area of research. Central in web communication is the issue of user identity and user profiling. Current practice is that user identity is mostly handled at the level of individual web sites or applications. For web visitors this means they have to recreate their identity and their profile in many different places. The control of users over their own profile is limited, as it is usually stored in application-specific cookies. There are several proposals to “put the web visitor back in the driver seat”, such as FOAF, OpenID and OpenSocial. We expect these mechanisms to change the scenery significantly over the next few years. 4 Social and economic issues in digital heritage Although heritage institutions, at least in Europe, are almost all public institutions funded with public money, this does not mean that open access to culturalheritage data should be taken for granted. Projects like Europeana face social barriers in it strive for open access. This is understandable: the heritage institutions have built up their knowledge and data over a long time with an eye on quality control, and are anxious to make this available with the risk of it being used or interpreted in the wrong way. For open access to be become the norm instead of the exception leaders in the field have to set the example. In the library world Library of Congress has done this by making their Subject Headings publicly available in a SKOS format. Major European national libraries, musea and archives are now doing the same within Europeana. Access to object data, such as images of paintings, still gives rise to rights and authority issues. New license schemes are required. Creative Commons is frequently mentioned in this context, but is not really tailored to the type of heritage field. Similar schemes, but targeted specifically at data collections, have 5 http://www.steve.museum/ 6 http://www.foaf-project.org/ 7 http://openid.net/ 8 http://code.google.com/apis/opensocial/ 9 http://id.loc.gov/authorities/ 10 http://creativecommons.org/ been proposed. Open Data Commons appears to be a promising candidate for deployment in the area. The business models for digital heritage depend to some extent on the rights issue. A typical web business model would assume that all primary access to the collections is free, including low and medium-resolution images. Secondary services can be profit-based, such as access to high-resolution images, use of objects within a virtual museum shop (e.g., posters, clothing) and integration with tourist services (e.g. combining heritage access with city walks).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sociological Impact of Using Digital (Web-based) Analyses on Performance Measurement and Optimization of Digital Marketing among Young Managers (Case study: Digital-based Companies in Tehran)

This research aims to study the effect of using digital (web-based) analyses in performance measurement and optimization of digital marketing in digital-based companies in Tehran. The data collection tool was a researcher-made questionnaire. A panel of experts and supervisor were asked to measure the validity of the questionnaire. For reliability analysis of this tool, Cronbach’s alpha test was...

متن کامل

The aggregation of heterogeneous metadata in web-based cultural heritage collections: a case study

In this paper we discuss some issues related to digital libraries that are born as aggregators of cultural heritage resources harvested from different distributed digital repositories. Whenever such aggregations fall short of providing a real service to users, most of the time this is due to the inadequacy of the data (or conceptual) model proposed for the mapping and exposed to the final user....

متن کامل

Project "The Digital City Revives" A Case Study of Web Archaeology

Twenty-two years ago a city emerged from computers, modems and telephone cables. On 15 January 1994 De Digitale Stad (DDS; The Digital City) opened its virtual gates in Amsterdam. DDS, the first virtual city in the world, and made the internet (free) accessible for the first time to the general public in the Netherlands. But like many other cities in the world history, this city disappeared. In...

متن کامل

Reanimating cultural heritage through digital technologies

Digital technologies are becoming extremely important for web-based cultural heritage applications. This thesis presents novel digital technology solutions to ̳access and interact‘ with digital heritage objects and collections. These innovative solutions utilize service orientation (web services), workflows, and social networking and Web 2.0 mashup technologies to innovate the creation, interpr...

متن کامل

Crowdsourcing Digital Cultural Heritage

With the turn towards the digital age, a growing number of institutions in the GLAM (Galleries, Archives, Museums and Libraries) sector started to identify a need for digitising their different collections placing them online with goals to preserve and exhibit them in the digital environment. After the initial efforts to develop policies, methodologies and best practices in transferring the col...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010